Retention time alignment algorithms for LC/MS data
نویسنده
چکیده
The goal of proteomics is to understand the individual proteins and their roles in larger biological systems. The study of the human proteome is a crucial component in fields such as drug development and biomarker discovery. Proteomics employs the liquid-chromatography mass-spectrometry technique to identify and quantify the proteins of a complex biological sample. The primary obstacle in comparative proteomics is the alignment of the LC-MS outputs with respect to retention time. In this study, linear and loess regression algorithms presented by Podwojski [2009] are constructed and implemented in Stata for a randomly generated data set. The linear regression approach is only able to accommodate linear shifts in retention time, whereas loess regression allows for the correction of non-linear shifts. In the absence of a deterministic model, a new approximation for Akaike's information criterion is introduced to determine the bandwidth of each loess regression. The data is then corrected according to both regression algorithms. The effective alignment of each is analyzed to demonstrate the advantage of algorithms that adjust non-linear shifts.
منابع مشابه
PWA - 138 Statistical Significance in LC-MS based Label-free Protein Quantification Analysis
Label-free MS-based quantification of peptides from LC-MS data is a valuable complement to MS-based quantification technologies such as SILAC, ICPL, or gel based quantification. However, statistically valid labelfree quantification of peptides and proteins from a digest of a proteomics sample in up to hundreds of LC-MS experiments is a challenge, as it requires excellent sensitivity, mass accur...
متن کاملRetention time alignment algorithms for LC/MS data must consider non-linear shifts
MOTIVATION Proteomics has particularly evolved to become of high interest for the field of biomarker discovery and drug development. Especially the combination of liquid chromatography and mass spectrometry (LC/MS) has proven to be a powerful technique for analyzing protein mixtures. Clinically orientated proteomic studies will have to compare hundreds of LC/MS runs at a time. In order to compa...
متن کاملSIMA: Simultaneous Multiple Alignment of LC/MS Peak Lists
MOTIVATION Alignment of multiple liquid chromatography/mass spectrometry (LC/MS) experiments is a necessity today, which arises from the need for biological and technical repeats. Due to limits in sampling frequency and poor reproducibility of retention times, current LC systems suffer from missing observations and non-linear distortions of the retention times across runs. Existing approaches f...
متن کاملMulti-profile Bayesian alignment model for LC-MS data analysis with integration of internal standards
MOTIVATION Liquid chromatography-mass spectrometry (LC-MS) has been widely used for profiling expression levels of biomolecules in various '-omic' studies including proteomics, metabolomics and glycomics. Appropriate LC-MS data preprocessing steps are needed to detect true differences between biological groups. Retention time (RT) alignment, which is required to ensure that ion intensity measur...
متن کاملA statistical method for chromatographic alignment of LC-MS data.
Integrated liquid-chromatography mass-spectrometry (LC-MS) is becoming a widely used approach for quantifying the protein composition of complex samples. The output of the LC-MS system measures the intensity of a peptide with a specific mass-charge ratio and retention time. In the last few years, this technology has been used to compare complex biological samples across multiple conditions. One...
متن کامل